Introduction to the R package TDA
نویسندگان
چکیده
We present a short tutorial and introduction to using the R package TDA, which provides some tools for Topological Data Analysis. In particular, it includes implementations of functions that, given some data, provide topological information about the underlying space, such as the distance function, the distance to a measure, the kNN density estimator, the kernel density estimator, and the kernel distance. The salient topological features of the sublevel sets (or superlevel sets) of these functions can be quantified with persistent homology. We provide an R interface for the efficient algorithms of the C++ libraries GUDHI, Dionysus, and PHAT, including a function for the persistent homology of the Rips filtration, and one for the persistent homology of sublevel sets (or superlevel sets) of arbitrary functions evaluated over a grid of points. The significance of the features in the resulting persistence diagrams can be analyzed with functions that implement the methods discussed in Fasy, Lecci, Rinaldo, Wasserman, Balakrishnan, and Singh (2014), Chazal, Fasy, Lecci, Rinaldo, and Wasserman (2014c) and Chazal, Fasy, Lecci, Michel, Rinaldo, and Wasserman (2014a). The R package TDA also includes the implementation of an algorithm for density clustering, which allows us to identify the spatial organization of the probability mass associated to a density function and visualize it by means of a dendrogram, the cluster tree.
منابع مشابه
Introduction Package CircOutlier For Detection of Outliers in Circular-Circular Regression
One of the most important problem in any statistical analysis is the existence of unexpected observations. Some observations are not a part of the study and are known as outliers. Studies have shown that the outliers affect to the performance of statistical standard methods in models and predictions. The point of this work is to provide a couple of statistical package in R software to identi...
متن کاملData Mining in R using Rattle
This paper is a brief introduction to the concepts, methods and algorithms for data mining in statistical software R using a package named Rattle. Rattle provides a good graphical environment to perform some of the procedures and algorithms without the need for programming. Some parts of the package will be explained by a number of examples. ...
متن کاملEmploying UV/H2O2 process for degradation of 2,4-Diaminotoluene in synthetic wastewater
Background & Aims of the Study: Toluene-2, 4-diamine (TDA) is a famous carcinogenic aromatic amine that is mostly employed as an intermediate in the production of dyes and toluene diisocyanate. In this study, degradation and mineralization of TDA were investigated by UV/H2O2 process. Materials & Methods: This study is an experimental investigation on a laboratory scale. T...
متن کاملAn introduction to Topological Data Analysis: fundamental and practical aspects for data scientists
Topological Data Analysis (TDA) is a recent and fast growing field providing a set of new topological and geometric tools to infer relevant features for possibly complex data. This paper is a brief introduction, through a few selected topics, to basic fundamental and practical aspects of TDA for non experts.
متن کاملThe Elimination of Toluenediamine from Aqueous Solution by Reverse Osmosis
Toluene diamine (TDA) is a main carcinogenic aromatic pollutant in some industrial wastewater. In this study, the reverse osmosis with DSS-HR98PP as the membrane was employed for the removal of TDA in an aqueous environment. The Box–Behnken Design (BBD) of the experimentwas used to consider the effect of operational variables such as pressure, pH and the feed volu...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- CoRR
دوره abs/1411.1830 شماره
صفحات -
تاریخ انتشار 2014